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Summary. We review the concepts of the Voronoi binning technique [P, which 
optimally solves the problem of preserving the maximum spatial resolution of gen- 
eral two-dimensional data, given a constraint on the minimum signal-to-noise ratio 
(S/Af). This is achieved by partitioning the data in an adaptive fashion using a 
Voronoi tessellation with nearly hexagonal lattice. We review astrophysical appli- 
cations of the method to X-ray data, integral-field spectroscopy, Fabry- Perot inter- 
ferometry, N-body simulations, standard images and other regularly or irregularly 
sampled data. Voronoi binning, unlike adaptive smoothing, produces maps where 
the noise in the data can be visually assessed and spurious artifacts can be recog- 
nized. The method can be used to bin data according to any general criterion and 
not just S/J\f. It can be applied to higher dimensions and it can be used to generate 
optimal adaptive meshes for numerical simulations. 



1 Introduction 

In scientific research one often wants to measure a physical quantity at a 
certain set of positions in two (or more) dimensions. The quantity under 
exam can vary by many orders of magnitude between the different locations. 
When the quantity represents the intensity of a signal 5, its ratio with the 
corresponding noise Af will correspondingly vary. In general one wants to 
perform subsequent analysis on the recorded signal, for example to extract 
derived quantities, to compare the observations with model predictions or to 
better visualize and interpret the observed signal. In these cases it is useful to 
adaptively bin (or partition, or segment) the data to preserve the maximum 
spatial resolution, given a constraint on the required minimum S/M. 

Adaptive binning should not be improperly called or confused with adap- 
tive smoothing [2], which consists of correlating neighbouring information 
O m [5l [6l [7j . Smoothing is not ideal when an accurate error analysis is needed, 
it does not allow for a visual assessment of the noise level in the data and it 
makes spurious artifacts difficult to recognize. A comparison between Voronoi 
binning and adaptive smoothing is presented in 
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Fig. 1. contour plots for an idealized physical model. The blue dots represent 
the measurements for the two model parameters (xi,X2) for different random real- 
izations of the data. The large red circle is the average of the realizations, while the 
green cross marks the minimum of ■ Left panel: At high-tS/A/" all measurements lie 
close to the minimum, where the x^ is well approximated by a quadratic form and 
the average of the realizations agrees with the true minimum. Right panel: At low- 
S /M the measurements can explore higher levels of where the contours become 
asymmetric. In this case the average solution will be biased. 

2 Why is it useful to bin the data? 

The main theoretical motivation for binning is to reduce the bias that can be 
introduced, in the low-^/A/" regime, when fitting a general nonlinear physical 
model to the data. The x^i which describe the agreement between data and 
model, is in general approximated by a quadratic form near the minimum, but 
far from it may become asymmetric (unless the model is fully linear, which 
is rarely the case in practice). An illustration of the situation is presented 
in Fig. [TJ When the data have high S/M the perturbed solutions stay close 
to the minimum and the average solution is unbiased. But at low S/M the 
perturbed solutions can explore higher levels of x^, where the contours become 
asymmetric. In this case the average solution does not agree with the true 
minimum and is biased. Binning the data to increase the S/J\f and fitting the 
model to them can still provide an unbiased solution to this problem. 

3 How to define the optimal binning? 

Earlier attempts to bin two dimensional data relied on recursive methods. A 
classic one is the Quadtree [10 which recursively partitions a region into axis- 
aligned squares. A variation was proposed by [llj and applied to the adaptive 
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Fig. 2. Left panel: A honeycomb consists of a lattice of hexagonal cells. Right 
panel: A simulated cluster of bubbles (from [9 ). The bubbles adapt their size to the 
enclosed amount of air, while keeping a compact (minimum surface) configuration. 
This constitutes an ideal physical model for an adaptive binning method in two and 
three dimensions. 



binning of X-ray images. The limitation of recursive methods is that they can 
only employ predefined sizes for the bins, which are powers of a basic element. 
This necessarily produces discontinuities in the S/JV and artifacts. To improve 
on recursive methods [Ij identified three criteria for the ideal adaptive binning: 

Topological: The data have to be partitioned without holes or overlapping 

bins: One wants to use all available data, without duplications; 
Morphological: The bins should be as compact or round as possible: One 

wants to preserve the maximum spatial resolution; 
Uniformity: The scatter in S/JV should be minimized: One wants to increase 

the S/JV to the required target level {S/JV)t^ but not above that, as that 

would degrade the spatial resolution. 

The criteria are quite general. They define the goal to partition the plane in 
a set of nearly circulai0 and maximally-packed regions enclosing equal amount 
of a certain quantity. This generality is one of the reasons why the criteria 
happen to be satisfied in a number of physical phenomena in Nature. An 
optimal partition of the plane is naturally produced in honeycombs, which 
consist of equal-size nearly hexagonal cells (Fig. [2]). Similar partitions with 
hexagonal lattice are naturally generated in convective cells, for example in 
the granulation of the solar photosphere. In these cases however the physical 
conditions that drive the plane partition are spatially constant and the cells 
are all equal. A beautiful example where an adaptive partition of space is 
naturally produced by an optimality criterion, in that case the minimization 

^ An alternative is discussed in the Contour Binning method [12 . 
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Fig. 3. The bin-accretion stage. Left panel: Tessellation generated by the bin- 
accretion stage of the Voronoi binning method for an image with about 350,000 
pixels, grouped into 3000 bins. The size of the bins adapts to the S/N" while keep- 
ing a compact shape. Right panel: The S/J\f of the original pixels is shown with 
the red points and the (<S/A/')bin of the bins with the open squares. The target 
(<S/A/')t = 20 is indicated by the solid line. 

of the surface, is provided by the configuration of a cluster of soap bubbles 
(Fig.[2|). This phenomenon constitutes an ideal physical model for the optimal 
binning method in two or three dimensions. 

4 The Voronoi binning method 

To simplify the terminology in what follows we call 'pixel' an individual data 
element. This is not just the pixel of an uniformly sampled image but repre- 
sents a generic quantity which is measured at a certain location identified by 
two (or more) coordinates. In applications to integral- field spectroscopy, for 
example, a pixel refers to a full spectrum measured at one sky location. 

The Voronoi binning method consists of two separate stages: (i) The bin- 
accretion stage, which generates an initial set of bins satisfying the criteria of 
Section [3] and (ii) an optional regularization stage, which tries to improve the 
binning produced at the previous stage. Here we summarize the main ideas 
of the method. A detailed description of the technique, including essential 
details, is given in [Ij. A reference implementation is available onlin^l. 

4.1 The bin-accretion stage 

The Voronoi binning method starts by trying to generalize the process used 
to bin one-dimensional data (e.g. [13j). In one dimension one starts from the 
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pixel with the highest S /J\f and keeps 'accreting' neighbouring pixels, until the 
target (5/A/')t is reached. In two (or more) dimensions one can still accrete 
pixels that are closest to the current bin, however there will be many adjacent 
pixels and one has to decide which one to accrete next. Moreover the S /M 
criterion is not sufficient to decide when to start a new bin. The bin shape 
also plays a role as the bin may become too elongated e.g. when the edge of 
the data is reached. The bin- accretion stage of the Voronoi binning algorithm 
solves these problems with two simple choices: 

1. The next pixel to accrete into the current bin is the closest to its centroid; 

2. The accretion of new pixels into a bin continues as long as the criteria for 
the optimal binning of Section [3] are satisfied. 

This process continues, pixel-by-pixel, until all pixels in the image have been 
processed. At the end of the process however, some pixels will generally remain 
unbinned and a few bins will not have reached the desired {S/M)t' The 
method computes the geometric centroid Zj of all good bins and these are 
used as generators for a Voronoi tessellation (VT; see [H]), which partitions 
the data into convex bins Vj {Voronoi regions): The pixel is assigned to 
the bin that minimizes the distance di = \\zj — x^||. 

An example of binning produced by the bin-accretion stage of the method 
is shown in Fig. [3l The bins tend to the desired hexagonal lattice, but their 
size adapts to the S/JV in such a way that they have nearly the same S/JV. A 
possible choice to estimate the S/M oi a bin during the accretion is to define 

(1) 

where the summation is done over the pixels in that bin. However the bin- 
accretion algorithm is completely independent on the way in which {S/Af)hin 
is estimated! 

In some applications the noise of the individual pixels may be correlated, so 
that equation ([T]) is not accurate. Or it may be possible to determine {S/JV)hin 
only when it is sufficiently high. For example when binning clumpy gas in a 
galaxy the Si in many pixels may be undetected before binning. Alternatively 
one may want to bin the data not according to (tS/A/'jbin, but according to 
the errors in an extracted quantity (e.g. kinematics) from the bin. All these 
situations can be trivially included in the bin-accretion algorithm by replacing 
equation ([T]) with a procedure to estimate (^/A/'jbin, or any other quantity of 
interest, during the bin accretion. 

Due to its generality of estimating the S/Af in a bin, and due to its ability 
to produce small scatter in the S/M^ also in the few-pixels regime, this bin- 
accretion stage may be all that one needs to Voronoi bin the data. Optionally 
however one may start from the binning generated by this stage and further 
improve it via some iterative process. We call this the bin-regularization stage, 
which is the subject of the next section. 
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4.2 The bin-regularization stage 

In the two important cases where (i) the noise is approximately Poissonian 
(shot noise), or (ii) when the pixels in each bin are optimally weighted with 
weights w — Sj/Mj to maximize (5/A/')bin ^3^, then equation ([T]) reduces 
to («5/A/')^in = 1-^)1' One can then define a density distribution p(y) = 
(«S/A/')^(y) such that the problem of binning to a constant S/J\f reduces to 
that of obtaining a tessellation enclosing equal mass according to p. In that 
case a Centroidal Voronoi Tessellation (CVT) can be used to regularize the 
binning provided by the bin- accretion stage. 

Given a density function p(r), a CVT is a special class of VT where the 
generators Zj happen to coincide with the mass centroids 

_ A.yp(y)dy 

of the corresponding Voronoi regions Vj (see review [E]). A striking char- 
acteristic of the CVT in the two-dimensional case is its ability to partition 
a region into bins whose size varies as a function of the underlying density 
distribution, but whose shape tends asymptotically to an hexagonal lattice 
for a large number of bins. Another nice feature of the CVT is that a sim- 
ple algorithm exists for its practical computation, Lloyd's method [18 , which 
simply consists of a repeated application of equation ([2j). In the statistical 
literature (e.g. [19j) discrete CVTs are related to optimal /c-means clusters, 
where Voronoi regions and centroids are referred to as clusters and cluster 
centers, respectively. 

Although CVT bins are naturally smaller where the density is higher, 
the mass Mj = Ajpj enclosed in a bin, where Aj is the bin area, is not 
constant. However ^ demonstrated that if the CVT is constructed for the 
density = p^, then Lloyd's algorithm converges toward an equal-mass CVT 
according to p. 

An alternative to the regularization stage using the equal-mass CVT has 
been proposed by [8j. It consists of using a Weighted Voronoi Tessellation 
(WVT; [20]), which is a generalization of a VT in which bins are not uniquely 
defined by the generators, but also have an associated scale Sj. The Lloyd's 
iteration of the CVT scheme is modified as follows: (i) A pixel with coordinates 
is assigned to the bin that minimizes the scaled distance di = \\zj — x^H/Jj, 
producing a WVT (where Sj = 1 reduces to the standard VT); (ii) The 
density is set to a constant p{y) = 1, so that z* in equation ([2]) becomes 
the geometric centroid; (iii) The scale of each bin during Lloyd's iteration is 
updated as Sj = Aj / {S / JV)hin' Clearly a binning with constant S/JV across 
the field is a stable fixed point of this iteration scheme, as it satisfies the 
relation Sj/Sk = ^ Ajj A]^ (see [8 for details). The advantage of regularizing 
the bins using a WVT instead of a CVT, is that the former, like the bin- 
accretion algorithm, allows for a completely general form of the noise. The 
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Fig. 4. Voronoi binning of X-ray data: Left Panel: Projected emission- weighted tem- 
perature map of the core of the Perseus cluster from a 191 ks Chandra observation. 
Right Panel: Corresponding map of abundance. (From [23]). 



bins also tend to be more compact at the edge of the field. A generally minor 
disadvantage is that the resulting binning is not a standard VT: The scales 
Sj are required to describe the bins, together with the generators zj and the 
bins are not necessarily convex. 

As a practical advice, in most situations the bin-accretion algorithm of 
Section [411 produces a binning that already closely satisfies the optimal crite- 
ria of Section[3](Fig.[3|). If one wants the binning to be described by a VT, and 
if the noise is approximately Poissonian (or the pixels are optimally weighted) , 
one can apply the CVT regularization stage. In the other cases one may just 
skip the regularization stage or apply a WVT regularization. All these options 
are available in the reference implementation^ and are quick to test on the 
data at hand. An implementation adapted by I8j, including specific examples 
of usage on X-ray data, is also available onlincl- 

The size of the equal-mass CVT can be adapted to any desired density 
distribution while still preserving the nearly hexagonal lattice. The same is 
true for the WVT. This makes the algorithm an excellent mesh generator if 
one considers that the dual of a Voronoi tessellation with nearly hexagonal 
bins is a Delaunay triangulation with nearly equilateral triangles [H [21] . A 
recent astrophysical illustration of the power of this general idea is given by 
the mesh regularization scheme in the AREPO cosmological simulation code 
[22]. 



http: / / www.phy.ohiou.edu / ^diehl /WVT / 
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Fig. 5. Voronoi binning of X-ray data: Top left: Chandra image of Cassiopeia A with 
an exposure time of 1 ks. Top right: the same data adaptively binned to {S/JV)t = 5. 
Bottom left: Cassiopeia A with the full exposure of 50 ks. Bottom right: the same 
image binned to {S/JV)t 20. (From [8^). 

5 Applications of the Voronoi binning method 

The Voronoi binning method was introduced and discussed in detail in [Ij, 
where it was illustrated on images and integral-field spectroscopic data. The 
method however was designed to be applicable to general two (or more) di- 
mensional data. In this section we review some representative astrophysical 
applications of the method since it was introduced. 

X-ray: The first scientific application of the Voronoi binning method was 
presented by ^ . The authors used the bin-accretion algorithm to produce a 
colour image from deep Chandra X-ray observation of the cluster environment 
of radio galaxy 3C 294 at redshift z ^ 1.8. The bin-accretion method was sub- 
sequently used to construct a temperature map from a now classic Chandra X- 
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Fig. 6. Voronoi binning of integral-field spectroscopy: Top Panels: Mean velocity 
V and velocity dispersion a of the stars in NGC 4459 (from [38 ). Bottom Panels: 
Mean velocity and equivalent width (EW) of the ionized gas in NGC 4278, as traced 
by the [OIII] AA4959,5007 emission lines (from [39j). 



ray 191 ks observation of the "shocks and ripples" at the center of the Perseus 
cluster [25 . The authors binned the X-ray data-cube by requiring at least 
2000 counts per bin {{S/JV)t ^ 45). The binned X-ray spectra were fitted 
with models, while varying the temperature, abundance and column density. 
A clear visualization of the temperature map from the same data, together 
with the abundance structure, was also presented by [23j and is reproduced 
in Fig. m It shows that the morphology of the abundance map follows the 
temperature map closely. An incomplete list of additional applications of the 
Voronoi binning method to X-ray data from the Chandra and XMM-Newton 
satellites includes [8, 26j|27l[2i[23[30l[ai[3llMlMl.35, 36, 37J. An excellent 
example of the usefulness of Voronoi binning for the visualization of X-ray 
data is reproduced in Fig. [5l 
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Fig. 7. Voronoi binning of Fabry-Perot interferometry: Top left: Unbinned mean 
gas velocity in the spiral galaxy NGC 5055. Top Right: Smoothed velocity with 
a Gaussian kernel of 6x6 pixels. Bottom left: Adaptively binned to (<S/A/')t = 7. 
Bottom right: Bin sizes of the Voronoi binning. (From [58]). 

Integral-field spectroscopy: The Voronoi binning method was used to 
analyze integral-field spectroscopic data from the SAURON survey [40j which 
produced - 250, 000 SAURON/OASIS spectra for 72 galaxies. The data were 
spatially binned to a {S/M)t ^ 60, required to reliably extract the stellar 
kinematics [38, 41 . The same spectra in each bin were then used to also 
extract the intensity and kinematics of the emission line ionized gas [39l [42] 
and the stellar absorptions line-strength [43"^ ^44] (Fig. [6]). Some applications by 
other groups to integral-field data from other spectrographs, include VIMOS 
[45J and SINFONI [H [47] on VLT, CMOS [H [49] and NIFS [50l |51] on 
Gemini, DensePak at WIYN [52], MPFS on the Russian 6-m SAO [53l|5l]. 

Dynamical modeling: The use of binned SAURON data was crucial 
also to reduce the size of the data in the dynamical modeling of the stellar 
kinematics for the sample [55} [56]. Both the computation time and the memory 
usage in Schwarzschild's orbit-superposition method [57J scale linearly with 
the number of data points. The binned data typically had a size ~ 5x smaller 
than the original data. 

Fabry-Perot interferometry: Voronoi binning was extensively used to 
extract the Ha emission line gas flux and kinematics from Fabry-Perot inter- 
ferometric observations of spiral galaxies [59l [60l [611 [621 [63l [BH [65] [66] [571 EH] • 
In this case, due to patchy nature of the observed gas flux, the Sj could not 
be reliably pre-computed before the binning. The {S/M)un of the gas was 
determined by replacing equation ([T]) with a direct estimation of {S/M)un 
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Fig. 8. Voronoi binning of N-body simulations: Top panels: From left to right are 
shown the surface brightness, the mean velocity and the velocity dispersion of a 
simulated merger remnant. Bottom panels: Same as in the top panels after adaptive 
binning to (<S/A/')t = 10 (from [69j). 



from the spectra during the bin-accretion stage [58j. Fig. [3 is a good example 
of the success of this approach. 

N-body simulations: In the context of the ATLAS^^ projeclEI, Voronoi 
binned N-body simulations of galaxy mergers have been used by ^ to better 
visualize the kinematics of the remnants in the outer regions and compare 
them directly with integral- field observations (Fig. [8]). 

Standard images: Photometric studies of galaxies generally adopt ellipse- 
fitting techniques to analyze radial surface brightness profiles and colours of 
galaxies (e.g. [T^)- However in cases where a symmetry along ellipses is not 
desired, one can Voronoi bin the images to study faint features in the data. 
Binning was used by [73l [TH [75] to study galaxy colours from HST/ACS ob- 
servations of galaxies at intermediate redshift {z ~ 0.5). It was employed to 
look for star formation in the intracluster light of the Spiderweb system ([70], 
Fig-E]), to study colours of stellar shells in elliptical galaxies merger remnants 
[76) and to produce colour maps of early- type galaxies [771 • indepen- 
dent implementation of the Voronoi binning method, which uses a different 
bin-accretion algorithm, but the same CVT regularization stage, was applied 
by [7T] to study the characteristics of Jupiter's UV auroral emission from 
HST/ACS images (Fig. ED . 

Star counts: In [79j Voronoi binning was used to group stars counts, as 
observed with HST/ACS, in the halo of the spiral galaxy NGC 891, an edge-on 
galaxy considered to be an analogue of the Milky Way. A colour-magnitude 



^ http://purl.org/atlas3d 
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Fig. 9. Voronoi binning of images: Left panel: Slope of the UV spectrum of galaxies 
and intergalactic light of the Spiderweb system (from HST/ACS images). Galaxies 
are outlined in black. The Lya flux peaks on the galaxy that lies ~ 3 arcsec west 
(x ^ 11^^ arcsec in the figure) of the radio galaxy, which has the bluest colour 
compared to the other satellite galaxies (from [70^). Right panel: Image extracted 
from the polar projection of Jupiter's UV auroral emission from an HST/ACS image 
(from [n]). 

diagram was constructed for the stars in each bin to study the variation of the 
metallicity in the halo. This allowed the authors to trace metallicity gradients 
at extremely faint surface brightness levels fFig. [TQ|). 

General irregular data: To conclude this overview of applications of 
the Voronoi binning method, we show in Fig. [11] the usage of the technique to 
increase the ^/A/" of a generic map of two-dimensional quantities, not arranged 
in a regular grid. Here the coordinates represent the largest two principal 
components (PC), as obtained from PC analysis of a large set of spectra from 
the Sloan Digital Sky Survey. The figure quantifies dust effects as a function 
of the two PC, which are closely related to the strength of the 4000 A break 
and to the equivalent width of the Balmer US absorption line. 



6 Conclusion 

We summarized the motivations and criteria for an optimal binning of two- 
dimensional data. The ideal binning consists of a Voronoi tessellation de- 
scribed by a lattice of nearly hexagonal bins, which adaptively change their 
size according to the S/Af of the data. We described the main concepts of the 
Voronoi binning method, an algorithm that produces a tessellation satisfying 
the desired optimal criteria [1]. 
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Fig. 10. Voronoi binning of star counts: Top panel: The spatial distribution of point- 
like sources from three HST/ACS fields is superimposed on the K-band 2MASS 
image of NGC 891. Bottom panel: Voronoi binned map of the median metallicity 
over the ACS survey area. A large-scale gradient in the halo region is noticeable, as 
well as a more metal-rich thick disk (from [79^). 
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Fig. 11. Voronoi binning of irregular data: Left panel: The volume- weighted mean z- 
band dust attenuation of galaxies in each joint PC1/PC2 histogram bin. Right panel: 
The volume- weighted mean Balmer decrement of galaxies in each bin. A single burst 
track of 1% burst fraction is overplotted. (From [80 ). 
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We reviewed the main applications of the method in astrophysics, in par- 
ticular to X-ray data, integral-field spectroscopy, Fabry-Perot interferometry, 
N-body simulations, standard imaging and other general data. The simplicity 
and generality of the method makes it useful in other fields of science and in 
the related problem of adaptive mesh generation. No changes to the original 
concept are required to use the method in three dimensions (e.g. [81j). We 
trust that in the future, as it happened in the past, we will see new applica- 
tions of the technique beyond anything we had anticipated. 
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